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Abstract 

An open quantum system (i.e., one that interacts with its environment) is al- 
most always entangled with its environment; it is therefore usually not attributed 
a wave function but only a reduced density matrix p. Nevertheless, there is a 
precise way of attributing to it a wave function rpi, called its conditional wave 
function, which is a random wave function of the system whose probability distri- 
bution [i\ depends on the entangled wave function ip S M\ <g> 3%2 in the Hilbert 
space of system and environment together. We prove several universality (or typ- 
icality) results about they show that if the environment is sufficiently large 
then hi does not depend much on the details of ip and is approximately given 
by one of the so-called GAP measures. Specifically, for most entangled states ij) 
with given reduced density matrix pi, p,i ls close to GAP(pi). We also show that, 
if the coupling between the system and the environment is weak, then for most 
entangled states ip from a microcanonical subspace corresponding to energies in a 
narrow interval [E, E + 5E] (and most bases of Jify), A*i is close to GAP(pp) with 
pp the canonical density matrix on at inverse temperature f3 = f3(E). This 
provides the mathematical justification of the claim that GAP(pp) is the thermal 
equilibrium distribution of ipi. 

Key words: Gaussian measures, GAP measures, Haar measure on the unitary 
group, thermodynamic limit, canonical ensemble in quantum mechanics, typicality 
theorems, conditional wave function, typical wave function. 
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1 Introduction 

In this paper we establish the universality of certain probability distributions on Hilbert 
spaces known as GAP measures [H [19] . This makes precise some statements and math- 
ematical considerations outlined in our earlier paper [8] on the thermal equilibrium 
distribution of the wave function of an open quantum system. 

By saying that GAP measures are universal we mean that the distributions fi± (de- 
scribed below) are typically close to a GAP measure, 

/it w GAP (1) 

when the system's environment is sufficiently large. To illustrate the terminology of 
universality, one can say that the central limit theorem conveys a sense in which the 
Gaussian probability distribution on the real line is universal: many physically relevant 
probability distributions are approximately Gaussian. Instead of universality, one also 
often speaks of typicality; we use these two terms more or less interchangeably. 

The family of GAP measures is a family of probability measures on Hilbert spaces. 
There is one GAP measure for every density matrix p in a Hilbert space Jif, denoted 
GAP(p); it is concentrated on the unit sphere in Jjf, 

S(jr) = {i/> e St? : \\i/>\\ = 1} . (2) 
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The density matrix of GAP(p) is p, in the usual sense that, for any probability measure 
jj, on E>(Jif), its density matrix is 



p,= / (3) 

JS(JT) 

which is also the covariance matrix of /i provided has mean zero. The GAP measures 
relevant to thermal equilibrium are those associated with canonical density matrices 

P/^e"^, (4) 

where Z = tre~^ H is the normalization constant, (3 the inverse temperature and H the 
Hamiltonian. Detailed discussions of GAP measures and their physical applications can 
be found in [SJ [19]. See (22] for a study about the support of GAP measures, that is, 
about what GMP^^-distributed wave functions typically look like. 

The main application of GAP measures is the characterization of the wave functions 
of systems we encounter in nature. In most cases we do not know a system's wave 
function, for example because it is a photon coming from the sun (or another star, or 
the cosmic microwave background, or a lamp), or because it is an electron that has 
escaped from a piece of metal. But in many cases the system is more or less in thermal 
equilibrium, and then, according to the considerations presented in [8] and here, its wave 
function should be GAP distributed. 



1.1 Conditional Wave Function 

Consider a composite quantum system consisting of two subsystems, system 1 and sys- 
tem 2, with associated Hilbert spaces J#i and Suppose that the system is in a pure 
state ip G J^totai — ® We ask what might be meant by the wave function of 
system 1. An answer is provided by the notion of conditional wave junction, defined 
as follows [8]£i] Let b = {bj} be an orthonormal basis of For each choice of j, the 
partial inner product (bj\ip), taken in J%2, is a vector belonging to M\. Regarding j as 
random (and therefore writing J), we are led to consider the random vector ipi G M\ 
given by 

*. = T^T (6) 

ikm^ii 1 ' 

where bj is a random element of the basis {bj}, chosen with the quantum distribution 

= j) = \Ubj\M\ (7) 



1 This definition is inspired by Bohmian mechanics, a formulation of quantum mechanics with particle 
trajectories, where the (non-normalized) conditional wave function tpi of system 1 is defined as [4] 

ik(x)=il>(x,Y) (5) 

for x in the configuration space of system 1, with Y the actual configuration of system 2. 
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We refer to ipi as the conditional wave function of system l|§ 

The distribution of ipi corresponding to and (j7|) is given by the following proba- 
bility measure on §(J#f): The probability that ipi E A C §(Jgf) is 

Hi{A) = pf b (A) = e A) = Y,U b M\ 2 ^ M/mm \(A) (9) 

-^HM)- (10) 

where 5$ denotes the Dirac "delta" measure (a point mass) concentrated at and 1^ 
denotes the characteristic function of the set A. While the density matrix p w associated 
with Hi always equals the reduced density matrix pf of system 1, given by 

pf = tr 2 |^)(^| = ^(6 J #)(^|6,), (11) 
j 

the measure pi itself usually depends on the choice of the basis b, so pi = pf ,b . 



1.2 Summary of Results 

In this paper, we prove several universality theorems about GAP measures, Theorems [T|- 
IU formulated in Section |2j These are statements to the effect that for most wave 
functions ip from relevant subsets of J&i ® J&2 and/or most ortho normal bases b of J^, 
ipi is approximately GAP-distributed. Here, "most" means that the set of exceptions is 
small with respect to the appropriate natural uniform measure. 

The basic universality property is expressed in Theorem [TJ which asserts that for 
sufficiently large dim J#2, for any orthonormal basis b of J#2, and for any density matrix 
Pi on J%i, most ip in §(J^ <g> M^) with the reduced density matrix tr 2 = Pi are 

such that the distribution pf' b of ipi is arbitrarily close to GAP(pi), 

pf b » GAP(pi) . (12) 

This fact was derived (but not rigorously proven) in Section 5.1.3 of [8]. The rigorous 
proof of Theorem [1] is based on the fact, found independently by several authors [2l"| 

2 The conditional wave function can be regarded as a precise version of the "collapsed" wave function 
in the standard quantum formalism: Suppose that system 1 has interacted with system 2, and their 
joint wave function, as produced by the appropriate Schrodinger evolution, is now 

X^Wf. (8) 

3 

where the Cj are complex coefficients and all tps are normalized. If system 2 is a macroscopic system 

(2) 

and the t/y s are macroscopically different states then in the standard formalism one regards j as 
random with distribution \cj\ 2 , and says accordingly that system 1 can be attributed the "collapsed" 
wave function ip^ with probability |cj| 2 . The conditional wave function of system 1, according to the 
above definition in the case that the tp^s are among the {bj}, is indeed ip^ with probability \cj\ 2 . 
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[23| [31 [21 [15] (see also [HI [251 Ell US]), that for a random n x n unitary matrix with 
distribution given by the Haar measure on the unitary group U(n), the upper left (or any 
other) k x k submatrix, multiplied by a normalization factor y/n, converges as n — > oo to 
a matrix of independent complex Gaussian random variables with mean and variance 
1. (To understand the factor y/n, note that a column of a unitary n x n matrix is a unit 
vector, and thus a single entry should be of order l/s/n.) 

Theorem [2] asserts that the conclusion of Theorem [T] — that (j!2p holds with arbitrary 
accuracy for sufficiently large dim Jif 2 — is also true for every i/> with tr 2 — Pi f° r 

most h (instead of for every b for most ip). 

Theorems [3] and H] justify the physical conclusion that, if a system (system 1) is 
weakly coupled to a very large (but finite) second system (the "heat bath," system 2) 
then, for most wave functions of the composite system with energy in a given narrow 
energy range [E, E + 5E], the conditional wave function of the system is approximately 
GAP-distributed for most orthonormal bases of the heat bath. In more detail, let the 
interaction between the two systems be negligible so that the Hamiltonian can be taken 
to be 

H = Hi®I 2 + h®H 2 (13) 

(with l\j2 the identity operator on Mx/q), and let M'r C <8> ^2 be a micro-canonical 
energy shell of the composite system, i.e., the subspace spanned by the eigenstates of 
the total energy with eigenvalues in [E, E + 5E}. Assume that the eigenvalues of H 2 are 
sufficiently dense and that the dimensions of ^> and j#r are sufficiently large. Then, 
for most ip G §(J%), 

pf b » GAP(pp) (14) 

for most bases b of Jt^; here, pp is the canonical density matrix 03]) and = /3(E) . 

In Theorems [3] and H] we relax the condition that ip have a prescribed reduced density 
matrix, and exploit instead canonical typicality. This is the fact, found independently 
by several groups El dH HZ] and anticipated long before by Schrodinger [2TJ, that for 
most ip G S(J^r), the reduced density matrix tr 2 \i/>)(ip\ is approximately of the canonical 
form (j3J) . More generally, in Theorems [3] and H] we may regard as any subspace of 
J%i <8> of sufficiently high dimension. Canonical typicality then refers to the fact that 
for most ip G S(J$?r), tr 2 \ip)(i/j\ is close to tr 2 Pr, where pr denotes 1/ dimJrffn times the 
projection to J%r; the precise version of canonical typicality that we use in the proof of 
our Theorems [3] and H] is due to Popescu, Short, and Winter [T51[T7]. Theorem [3] asserts 
that for most if) G S(J£r), 

pf b « GAPfJg) , (15) 

with Pr = tr 2 pr, for most bases b of M%. 

Theorem H] is a very similar statement but differs in the detailed meaning of 
and refers to a fixed density matrix, such as pp, in place of p^ in ( fT5l) . 

1.3 Remarks 

• Time evolution. It may be interesting to consider how pf ,b evolves with time if 
the wave function ip — ipt of systems 1 and 2 together evolves according to the 
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Schrodinger equation 

ih^ = H^ t . (16) 

In a situation in which most ip £ S(J^r) have pf' b ~ GAP(p^), we may expect 
that even for ipo £ E>(J4?r) with /^f°' b far from any GAP measure, pi(t) = /if*' 6 will 
approach GAP(p^) and stay near GAP(p R V> ) most of the time (though not forever, 
as follows from the recurrence property (almost-periodicity) of the Schrodinger 
evolution in a finite-dimensional Hilbert space). We leave this problem open but 
briefly remark that one can already conclude by interchanging the time average 
and the average over ip that whenever it is true for most ip £ §(J%) that pf ,b « 
GAP(p^), then for most ^ e S(J^), //f' 6 « GAP(p { £) for most times t; the 
open problem is to prove a statement that concerns all, rather than most, ipo. 

• The role of interaction. Another remark concerns the role of interaction (between 
the system and the heat bath) for obtaining the distribution GAP(pp). The na- 
ture of the interaction is relevant to our discussion in two places — although our 
theorems do not depend on it, as they do not mention the Hamiltonian at all. 
First, interaction is relevant for creating typical wave functions, as it helps evolve 
atypical wave functions into typical ones. This is closely related to the fact that 
a system coupled to a heat bath (i.e., a big second system) will typically go from 
non-equilibrium to thermal equilibrium only in the presence of interaction; see 
Section 4 of [7] for further discussion and examples. Second, it depends on the 
interaction which subspace of Jtf[ ® is the micro- canonical energy shell that 
we want ^r to be, and thus also which density matrix tr 2 Pr is. In the limit of 
negligible interaction, tr 2 Pr has the canonical form pp = (1/ 'Z)e~ l3H , while inter- 
action makes it deviate from this form. As a consequence of these two roles, when 
we want to obtain from non-equilibrium a wave function ip £ Jjtf[ <g> J%2 such that 
the distribution of the conditional wave function tpi is close to GAP(pp), we may 
want that the interaction be not too large (or else there will be deviations from 
pp) and that the interaction be not too small (or else it may take too long, say 
longer than the present age of the universe, to reach thermal equilibrium). 



1.4 Definition of the GAP Measure 

For any density matrix p on M' , the measure GAP(p) on (the Borel u-algebra of) E>(Jt?) 
is built by starting from the measure G(p), which is the Gaussian measure on J$? with 
mean and covariance matrix p. In this paper, we are interested only in the case 
dim Jtf? < oo. Then G(p) can be explicitly defined as follows: Let S be the subspace 
of Jtf? on which p is supported, i.e., its positive spectral subspace, or equivalently the 
orthogonal complement of its kernel, or equivalently its range; let d! = dim S and p + the 
restriction of p to S; then G(p) is the measure on Jf supported on S with the following 
density relative to the Lebesgue measure A on S: 

= J, }, exp(-Mp;V))- (17) 
dX n d det p + 
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Equivalently, a G(p)-distributed random vector if) is one whose coefficients (Xi\ip) relative 
to an eigenbasis {xi} of p (i.e., px% = PiXi with < Pi < 1) are independent complex 
Gaussian random variables with mean and variances E|(xj|-?/>)| 2 = pf, by a complex 
Gaussian random variable we mean one whose real and imaginary parts are independent 
real Gaussian random variables with equal variances. 
Noting that 

/ G(p)(^)|H| 2 = trp = l, (18) 
J, ^ 

we now define the adjusted Gaussian measure GA(p) on as: 

GA{p)m = \\n 2 G{ P )m- m 

If i\) GA is a GA(p)-distributed vector, then GAP(p) is the distribution of this vector 
projected on the unit sphere; that is, GAP(p) is the distribution of 



r^GAP = _r _ (2Q) 



^GA 
J^GA 



Like G(p) and unlike GA(p), GAP(p) has covariance matrix p. 

More generally, one can define for any measure p on J>$f the "adjust-and-project" 
procedure. We denote by Ap the adjusted measure 

MW) = U\\ ■ (2i) 

The projection on the unit sphere is defined as: 

P : ^T\{0} -)-S(^r), P(^) = ^-. (22) 



Then the adjusted-and-projected measure is P*(Ap) = Ap o P _1 , where P* denotes the 
action of P on measures, thus defining a mapping P„oi from the measures on Jrff with 
J p(dip) ||-^|| 2 = 1 to the probability measures on §(Jtf). 



2 Results 

2.1 GAP Measure From a Typical Wave Function of a Large 
System, Given the Reduced Density Matrix 

Let J^totai = >ffi\ ® <fflii where J£f and have respective dimension d\ and 0I2, with 
c?i < d 2 < 00. For any given density matrix p 1 on J#i, let 

^(pi) = G §(^ otal ) : pf = Pl } (23) 

be the set of all normalized wave functions in J^totai with reduced density matrix pf — p 1 . 
We will see that M(pi) is always non-empty. 
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Theorem [I] below concerns typical wave functions in M(p{), i.e., typical wave func- 
tions with fixed reduced density matrix. The concept of "typical" refers to the uniform 
distribution u pi on M{p\)\ an explicit definition of this distribution will be given in 
Section 13.11 

Before we formulate Theorem [TJ we introduce some notation. First, for any Hilbert 
space J^, let @(J4?) denote the set of all density operators on i.e., of all positive 
operators on J$? with trace 1. Second, when p is a measure on jriP or §>(J$?) and f(ip) 
is a measurable function on Jf or &(Jf?) then we use the notation 

p(f) := J M#)/W • (24) 

Third, let ||/||oo = sup^ 

Theorem 1. For every 0<e<l ; 0<5<l, and dx G N, there is D 2 = D 2 (e, 5, dx) > 
such that for all ^eN with d 2 > D 2 , for every M{ and ^ 2 with dimM[/ 2 = dx/ 2 , for 
every orthonormal basis b = {bx, ■ ■ ■ ,bd 2 } of ,W 2 , for every px G and for every 

bounded measurable function f : S(J(f[) — > R ; 

u pi [i, G M(px) : \pf\f) - GAP( Pl )(f)\ < £ ll/Hoo} > 1 - . (25) 

We give the proof, as well as those of Theorems EHU in Section [3j 
It follows from Theorem [1] that, for every sequence (^2 )n )neN of Hilbert spaces with 
d 2 ,n = dimJ^2 n — > oo as n — > oo and every sequence (b n ) neN of orthonormal bases 
b n = {bx,n, ■ ■ ■ j frd 2 ,n,n} °f ^2,n, for every px G @(J?x), and for every bounded measurable 
function / : S(J^i) —> R, the sequence of random variables Px"' bn (f), where \l/ n has 
distribution w pi on §(j£f <8> ^2,n), converges in distribution, as n — > oo, to the constant 
GAP(px)(f), in fact uniformly in p ly b n and those / with \\f\\oo < 1- Because of 
the convergence for every /, we can say that the sequence of random measures p± n,bn 
converges "weakly in distribution" to the fixed measure GAP(px). 

A few comments about notation. In j8], dx was called k, d 2 was called m, and the 
notation for the basis {bx, ■ ■ ■ , bd 2 } was {|1), . . . , \m)}. For enumerating the basis, we will 
use the letter j, and thus write bj\ in [8], the notation was q 2 for j (subscript 2 because 
it refers to Jif 2 ). For a random choice of j, we write J; the corresponding notation in 
[8J was Q 2 . 

2.2 GAP Measure From a Typical Basis of a Large System 

As already explained in [8], instead of considering a typical wave function and a fixed 
basis one can consider a fixed wave function and a typical basis. Let ON B(Jtf 2 ) be the 
set of all orthonormal bases of J^2, and recall the notation pf = tr 2 

Theorem 2. For every < e < 1, < 5 < 1, dx G N, d 2 G N wi/i d 2 > D 2 (e,5,dx) 
as in Theorem^ every J%x and with dim = dx/ 2 , every ip G S(J#{ ® ^2), and 
every bounded measurable function f : S(Jffi) — >• M, 

mojvb{ (61, • • • , b d2 ) G OiV5(^ 2 ) : \pf\f) - GAP(pf)(f)\ < e \\f\u] > 1 - 5, (26) 
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where uonb is the uniform probability measure on ON 'B(J^) , corresponding to the Haar 
measure on the unitary group U(J^). 



2.3 GAP Measure From a Typical Basis and a Typical Wave 
Function in a Large Subspace 

In our main physical application, the reduced density matrix pf is not fixed, although — 
by a fact known as canonical typicality — most of the relevant ^s have a reduced density 
matrix pf that is close to a certain fixed density matrix, for example to the canonical 
density matrix pp = (l/Z)e~^ H . In this section, we present two further universality 
theorems that are appropriate for such situations, in which the relevant set of ips is a 
subspace of M{ ® that will be denoted Mr. 

The physical setting to have in mind is this. A system with Hilbert space M{ is 
entangled with a large system whose Hilbert space is M^. The Hamiltonian H is thus 
defined on J^otai — ^\ ® ^2; suppose the total system is confined to a finite volume, so 
that H has pure point spectrum. Let [E, E + SE] be a narrow energy window, located 
at a suitable energy E such as one corresponding to a more or less fixed energy per 
particle or per volume. Then the micro-canonical energy shell is the spectral subspace 
of H associated with this interval, i.e., the subspace spanned by the eigenvectors with 
eigenvalues between E and E + SE, and this is our subspace Jif R . The micro- canonical 
density matrix p R is the density matrix associated with i.e., 1/ dim times the 
projection to Canonical typicality then asserts that for most wave functions in 

S(J^r), the reduced density matrix is approximately pp for an appropriate value of (3. 

For more general Jf? R , canonical typicality means that for most ip £ S(J^j), the 
reduced density matrix pf is close to tr 2 p R . The precise statement that we make use 
of is Theorem 1 of [TH] or the "main theorem" of [T7], which asserts, in a somewhat 
specialized and simplified form that suffices for our purposes: 

Lemma 1. Consider a Hilbert space M{ of dimension d\ G N, another Hilbert space M2 
of dimension d 2 G N and a subspace J^f R C <g> °f dimension d R . Let p R be l/d R 
times the projection to J4? R , and u R the uniform distribution on S(Jtf R ). Then for every 
r]>0, 



«4^W) : Pf-tr 2 p« >r / +^=l<4exp(-^ 
I tr yd R J V I87P 



(27) 



Here, the trace norm is defined by 



||M|| tr =tr \M\ = tr VM*M. (28) 

By the uniform distribution u R we mean the (2d R — l)-dimensional surface area measure 
on E>(Jif R ), normalized so that u R (E>(Jf R )) = 1. 

Theorem 3. For every < e < 1, < 8 < 1, d\ G N, every Hilbert space 3tf\ with 
dimJifi = d\, and every continuous function f : S(J^) — > M., there is a number D R = 
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D R (e, 5, d\, f)>0 such that for all d R , d 2 G N with d R > D R and d 2 > D 2 ( 2 y^ , 6/2, di) 
as in Theorem^ and for every ,W 2 an d J^r Q M{ ® J£| with dimJ^f 2 / R = d 2 / R , 

u R x u ONB { 6) £ S(J&) x ONB{J>%) : 

|//f 6 (/) - <L4P(tr 2 p*)(/)| < £ } > 1 - 5. (29) 

It follows that, for every sequence (^2, n ) ne N of Hilbert spaces with d 2>n = dim M'i^ — > 
oo as n — > oo, every sequence (^j? in )neN of subspaces of <g> with rf^ jn = 

dim J#R n — > oo as n — > oo, and every continuous function / : S(J^) — > R, the se- 
quence of random variables 

^• B -(/)_GAP(tr 2Pfl)n )(/), (30) 

where (\l/ n ,-B n ) has distribution u fln x uonb,ti on E>(J#R tn ) x ON B{J^ 2 ^ n ), converges to 
zero in distribution as n — > oo. We say that the sequence of random signed measures 
/if" ,B " — GAP(tr 2 pR <n ) converges "weakly in distribution" to zero. 

For < 7 < 1/ dim Jf? let ^> 7 (J^) denote the set of density matrices p G ${<ffl) 
whose eigenvalues are all greater than or equal to 7 (so that, in particular, zero is not 
an eigenvalue of p). 

Theorem 4. For every < e < 1, < 5 < 1, di 6 N, and < 7 < 1/di, there are 
numbers D' R = D' R (e, 5, d±, 7) > and r' = r'(e, d\, 7) > such that for all dR, d 2 G N 
with dR > D' R and d 2 > D 2 (e/2, 8/2, d\) as in Theorem^ for every Hilbert space M\ 
with dim^i = d\, for every Q G 3-> 1 {M\), for every and M'r C <g> with 
dim M'ljR = d 2 /R satisfying 

||tr 2 ( / 0fl)-fi|| tr <r', (31) 
and for every bounded measurable function f : S(J4f[) — >■ WL, 

u R x u ON b{ (V, 6) e x ONB(je 2 ) : 

(/) - GAP(n)(/)| < e H/IU} > 1 - 8. (32) 

If we want to consider just one particular density matrix Q (of which zero is not an 
eigenvalue) then we can set 7 equal to the smallest eigenvalue of Q. It then follows that, 
for every sequence (^2,n)neN of Hilbert spaces with d 2jTl = dim J^f 2n — > 00 as n — > 00, 
and every sequence (^R, n )neN of subspaces of <%{ ® ^2,n with d^ n = dim M'r^ — > 00 
and tr 2 pR, n — > ^ as n — > 00, the sequence of random measures p 1 n converges weakly 
in distribution to the fixed measure GAP(Q). In short, 

/if 6 UR m NB GAP(Q) . (33) 

Of the two theorems above, Theorem [3] is the simpler and perhaps more natural 
mathematical statement: it does not even mention any other density matrix than tr 2 pR, 
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its structure is to ask first that e, 5, and / be specified, which define the accuracy of 
the desired approximations^ and it applies to all subspaces J4?r of sufficient dimension. 
For the physical application, though, we often want to compare pf to GAP(Q) rather 
than GAP (tr2 Pr), for example because Q is the thermal density matrix pp = (l/Z)e~^ H 
while tr 2 pr is something complicated; we usually do not need that the estimate applies 
uniformly to all spaces M'r of sufficient dimension, but instead consider only one fixed 
Mr] and in that situation we can, in fact, obtain an estimate, the one provided by 
Theorem HI that is uniform in /. 

2.4 GAP Measure as the Thermal Equilibrium Distribution 

Theorem H] justifies regarding GAP(pp) as the thermal equilibrium distribution of the 
wave functions of the system 1 in the following way Let J#r be the microcanonical 
subspace, i.e., the spectral subspace of H associated with the interval [E, E + SE}. It 
is a standard fact (e.g., [6J, [12]) that when the interaction energy between system 1 and 
system 2 is sufficiently small, i.e., when we may set 

H = H 1 ®I 2 + h®H 2 (34) 

on J^totai — M\ <g> ffl 2 , and when system 2 is a large heat bath, i.e., when the eigenval- 
ues of H 2 are sufficiently dense, then tr 2 p^ is approximately of the exponential form 
Z^ 1 exp(— fiHi) with Z = tr exp(— (3 Hi) for suitable (3 > 0, i.e., is approximately the 
canonical density matrix pp. Then by Theorem H] in this special case of negligible inter- 
action we have that for most wave functions ip G §(J%), 

pf b » GAP(pp) (35) 

for most orthonormal bases b of J#2- 



3 Proofs 

3.1 Definition of u Pl 

According to the Schmidt decomposition [20], every ip e ^totai can be written in the 
form 

di 

i) = ^CiXi® (36) 

where {xi} is an orthonormal basis in J4f[, {(pi} is an orthonormal system in Jjf 2 (i-e., a 
set of orthonormal vectors that is not necessarily complete), and the q are coefficients 

3 How / defines a sense of accuracy becomes manifest if we consider finitely many test functions 
fx, . . . , ft, assume d,R > max(£>ij(/i), . . . , Dn^fy)), and then apply Theorem |3] to obtain that /if ' b and 
GAP(tr 2 pr) agree approximately on all linear combinations of /i, . . . , ft. 
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which can be chosen to be real and non- negative. If ||^|| = 1, the reduced density matrix 
of the system 1 is then 

Pt = ^\Xi)(Xi\- (37) 
1=1 

Thus, {xi} is an eigenbasis of pf, and are the corresponding eigenvalues. 

Now let a density matrix p 1 be given, let {xi} be an eigenbasis for pi, and let 
< Pi < 1 be the corresponding eigenvalues. Then every if) e &(pi) possesses a 
Schmidt decomposition of the form 

di 

ip = ^2VPiXi®<Pi (38) 

with some orthonormal system {0j} in J^. Indeed, we know it has a Schmidt de- 
composition fl36|) in which {xi} is an eigenbasis of pi, and cf are the eigenvalues. 
Reordering the terms in ( |36|) . we can make sure that q = y/pl. Any two eigen- 
bases {xi} and {xi} of pi are related by a block unitary; more precisely, for ev- 
ery eigenvalue p of pi, {x« : i G <^(p)} an d {x« : * £ ^(p)} (using the index set 
o^"(p) = {i '■ cf = p} = {i : pi = p}) are two orthonormal bases of the eigenspace of p, 
and thus related by a unitary matrix (Ujj^ij^jr^y. 

Xi= E } JO • (39) 

Setting 

& = E ( 4 °) 

we obtain fl38|) . and that {<f>i} is an orthonormal system. 

Conversely, every orthonormal system {(pi} in defines, by fl38l) . a. ip £ &(pi). 
Thus, f )38|) defines a bijection F pi) { Xj } : ONS(J%2, d\) — )■ ^(pi). The Haar measure on 
the unitary group of defines the uniform distribution on the set of orthonormal bases 
of J^2, of which the uniform distribution on ONS(J%2, di) is a marginal; let u pu ^ x .y be 
its image under F pu { Xi y. 

We note that u Pu { Xi y actually does not depend on the choice of the eigenbasis {Xi}- 
Indeed, if {xi} is any other eigenbasis of pi (without loss of generality numbered in such 
a way that the eigenvalue of \i is Pi) then, as explained above, it is related to {xi} by 
a block unitary d\ x d\ matrix U consisting of the blocks (U^). Let U be the matrix 

whose entries are the complex conjugates of the entries of U, and let U denote the action 
of U on ONS(J%>, d^ given by 

YJuOj-' 1 (41) 
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Then 

F P ,Ax,}=F PlA x,}^- (42) 
Since the Haar measure is invariant under left multiplication, its marginal on ONS(J4? 2 , di) 
is invariant under U. We thus define u Pl to be u pit { Xi y for any eigenbasis {Xi}- 

3.2 Proof of Theorem Q] 

We will obtain Theorem [1] from the corresponding statement about the Gaussian mea- 
sures G(pi): 

Lemma 2. For every 0<e<l, 0<5<1, and d\ G N, there is D 2 = D 2 (e, 5, d\) > 
such that for all d 2 G N with d 2 > D 2 , for every M{ and with dim^/ 2 — du2, for 
every orthonormal basis b = {bi, . . . , b d2 } of Jrf? 2 , for every p\ G and for every 

bounded measurable function f : — > WL, 

u Pl [i, G M{p x ) : \~pf\f) -G(px)(J)\ < ell/Moo} > 1 -8, (43) 

where pf is the distribution of x/d 2 (bj\'ip) G M{ (not normalized) with respect to the 
uniform distribution of J G {1, . . . , d 2 }. 

We now collect the tools needed for the proof of Lemma EJ We note that pf is the 
sum of d 2 delta measures with equal weights, 




located at the points 

Mj) = \/4<W> ■ (45) 

Let p\ G @ let {xi} be an eigenbasis of p\ with eigenvalues pi, and let ip G M(p\). 
According to ( 1381) . 

dl 

^i(j) = 5^CjV^2(^l0i)Xj (46) 

i=l 

with q = Now we regard ^ as random with distribution u pi ; then the are 

<i 2 random vectors, and pf is their empirical distribution. As noted above, {0j} is a 
random orthonormal system distributed according to a marginal of the Haar measure; 
in other words, the expansion coefficients (bj\<f>i) of 

^ = ^.|,^ (47) 
i=i 

form a di x d 2 matrix that arises as the first d\ rows of a Haar-distributed unitary d 2 x d 2 
matrix. 
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It is therefore interesting to us how a submatrix of a Haar-distributed unitary matrix 
is distributed. We cite the relevant result from Olshanskij [HI Lemma 5.3] and Collins [2J 
Chap. 4] (the corresponding fact for orthogonal matrices was established independently 
by Diaconis, Eaton, and Lauritzen |3J). To formulate this result, we use the following 
terminology. Recall that a complex Gaussian random variable is one whose real and 
imaginary parts are independent real Gaussian random variables with equal variances. 
The variation distance of two measures /i, v on the same cx-algebra stf is defined to be 

- u \\ = sup (fi(A) - v(A)) + sup (u(A) - fi(A)) . (48) 

In case \i and v possess densities / and g relative to some measure A on stf ', this coincides 
with the L 1 norm of / — g, 

llM-HI = / X(dx)\f(x)-g(x)\. (49) 

Lemma 3. For k,n e N with k < n, let the random matrix (Uij) be Haar(?7(n)) 
distributed, and let X be the upper left k x k submatrix multiplied by the normalization 
factor y/n, = y/nUij for 1 < i, j < k. Let G a random kxk matrix whose entries Gij 
are independent complex Gaussian random variables with mean and variance E|GV,-| 2 = 
1. Let Hk, n denote the distribution of X and that of G. Then converges, as 
n — > oo, to /ifc in the variation distance. In fact, as soon as n > 2k, /i^ n and fMj- possess 
densities fk, n and fk relative to the Lebesgue measure in C kxk given by 

det(/-— — )J (50) 

(where JVk,n is the appropriate normalization factor, ||^||oo = SU P l^^l/l^l; an d I 

t)ec fc \{o} 

denotes the kxk unit matrix) and 

fk{X) = - k e-* xx *, (51) 

and 

\\fk,n - fk\\mc k * k ) -> as n -> oo. (52) 

A random matrix such as G is y/k times what is sometimes called a "standard non- 
selfadjoint Gaussian matrix." We will use only the following consequence of Lemma |3j 
concerning the convergence of the upper left k x 2 entries of (Uij) to a matrix of inde- 
pendent complex Gaussian random variables: 

Corollary 1. For every < e < 1 and k £ N there is n = Uq(e, k) > such that for 
every n e N with n > n and every bounded measurable function g : C k — > K, 

Eg(^U 1± , . . . , y/nUki) -Eg(G n , . . . ,G kl ) < e WgW^ (53) 
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and 



E[g(y/nU u , y/nU kl )g(VnU 12 , . . . , VnU k2 )]- 

E[g(G u , • • • , G kl )g(G 12 , . . . , G k2 )] \<e\\gf OD , (54) 

where (Uij) is Haar distributed in U(n), and Gij (i = 1, . . . , k; j = 1, 2) are independent 
complex Gaussian random variables with mean and variance E\Gij\ 2 = 1. 

Proof. Choose no = n (e, k) > 2k so that, for all n > no, \\f k , n — fk\\L 1 (c kxk ) < e - This 
is possible by Lemma |3j Then ( 153|) and ( 154)) follow. □ 

Proof of Lemma Set 

D 2 (e,l,tZi) = maxfn Q,di),no(^,G?i), JpJ • (55) 

Let us first introduce some abbreviations and notation. We write n for d 2 and 
for d\. Let Gu, . . . , Gfci, G\ 2 , . . . , G&2 again be independent complex Gaussian random 
variables with mean and variance E|GV,-| 2 = 1. We use the basis {xi} to identify M{ 
with C k . Let / : C k — > R be a bounded measurable function, and let q > be as before 
(the square roots of the eigenvalues of pi). Set 

g(zt, ...,z k ) = /(ci-21, . . . , c fc 2: fc ) . (56) 

Then g, too, is measurable and bounded with bound \\g\\oo = H/Hoo- As we said above, 
the distribution of the (bj\4>i) is the same as that of Uij with j e {1, . . . , n}, that is, the 
first k rows of a Haar distributed unitary n x n matrix. We thus write instead of 
(bj\<j)i), and obtain 

1 n 

PtU) =■ Kf) = ~Y,9(V^U lv y^U kj ) . (57) 

U 3=1 

We write X for -y/n times the upper left k x n submatrix of (L^-), Xj for the j-th column 
of X, i.e., 

X j = (^U lj ,...,yfiiU kj ), (58) 
and Gj for (Gij, . . . , G k j) (i = 1,2). In this notation, 



1 

and 



n 



G(pi)(/) = E/CdGn, . . . , c fc G fcl ) = %(d) . (60) 
The expression to be estimated is 

m-G( Pl )(f~) , (6i) 
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for which we have by the triangle inequality that 



/}(/) - G( Pl )(f) < /}(/)- W) + Efl(f) - G( Pl )(f) 



(62) 



Note that the second contribution in (1621) is nonrandom. 
We first show that 

e 



Ejl(f) - G( Pl )(f) 
if n is sufficiently large: By floTJj) . 



< 



E 



1 n 
n L — ' 



(63) 



(64) 



3=1 



because each Xj has the same distribution — because the columns of (fy) are exchange- 
able due to the invariance of the Haar measure. By f )53|) in CorollarylU if n > n (e/2, k) 
then the absolute difference between f )60|) and ( |64"|) is less than §||g||oo, i.e., (163]) holds. 

Concerning the first contribution in ([6"2"j) . Chebyshev's inequality (see, e.g., [U p. 65]) 
asserts that 

4 



F(|M/)-E/2(/)|>|" ;; 



< 



£ 2 



var (/!(/)) 



(65) 



where P is the Haar measure for £/y and var(y) is the variance of the random variable 
y. Now Lemma [2] follows if we can show that 



,„,?,, fe 2 
var(/i(/)) < — 



2 

oc 



(66) 



for sufficiently large n. We find that 

1 n 

var(/i(/)) = E[/i(/) 2 ] - (E/2(/)) 2 = - £ E^X,)^)] - [Eg(Xi)] 2 ■ (67) 



Since the X,- are exchangeable, the joint distribution of Xj and Xjv for j ^ f is the 
same as the joint distribution of Xi and X 2 , so that all summands with j ^ f are equal 
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(and all summands with j = f are equal), and we can write 



var(ju(/)) 



n — n 



n 



E[g(X 1 )g(X 2 )] + - 2 E^X^-^X^ 

<\Vj\\L 



(61 



< E{g{X x )g{X 2 )\ - -E{g{X x )g{X 2 )\ - [%(^i)] 2 (69) 



>-||9l! 2 oo 



<E[g(X 1 )g(X 2 )] + -Ml -[Eg(X 1 )] 2 

= E[g(X 1 )g(X 2 )} -E [</(&) g{G 2 )] + [E^GO] 2 + 



(70) 



=o 



+ "NIL - [%M 



< 



< 



2 
oo 



E^)^)] -E[p(G!)p(G 2 )] 
+ [E^GO] 2 "^!)] 2 
E[<7(Xi)y(X 2 )] -E[ ff (G 1 ) 5 (G 2 )]| + ^y| 2 0O + 
- \Eg(G 1 )-Eg(X 1 )\ lEg^) +Eg(X 1 )\ 



<2||S||, 



~2 



;2 



v ,,oo + -r^lljglloo 211^1100 

16 n 16 

for n > rio(5e 2 /16, /c) by Corollary [TJ If in addition n > 32/5e 2 , we thus have that 



(71) 

(72) 
(73) 
(74) 



var(/i(/)) < 5e 



~2 II „||2 



1 1 l\_Se 
16 + 16 + 8/ ~4 



~2 



~2 



5e 
T 



(75) 



Thus, for d 2 = n > D 2 in ( 1551) . both ( 1631) and ( )66l) hold; Lemma |2] follows using f )62l 
and (E5D. □ 



We now collect the tools needed for the proof of Theorem [H 
Lemma 4. p${\\ ■ || 2 ) = 1. 

Proof. Recall that /if is the distribution of y/d^lpj^) with P(J = j) = l/d 2 . Thus, 



fiiid^mi 2 = E(d 2 \\(bjm 2 ) = y^ 



i . 



3=1 



(76) 
□ 



Lemma 5. = P^Afi^ 
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Proof. By definition, Ajif is the distribution of yfd^(b j\ip) with P(J = j) = IK^jl^)!! 2 , 
and P^Apf is the distribution of (bj\ip)/\\ (bj\ip) || with P(J = j) = ||(&j#)|| 2 . The latter 
is the definition of pf. □ 

Recall from (EES} that, for every p G ^(Jf ), 

C?(p)(dV) W 2 = l- (77) 

Lemma 6. For every < e < 1 and d G N, t/iere is R = R(e, d) > stzc/i i/iai, /or 
every toif/i dim^f 7 = d and ever?/ p G ^(J^ 7 ), 

/ G(p)(#)|M| 2 >l-e. (78) 

</{</>e,^:||V||<.R} 

Proof. Let Xi, . . . , be independent complex Gaussian random variables with mean 
and variance 1. Then X = (Xi, . . . , X^) has Gaussian distribution G(I) with covariance 
matrix /, the identity matrix; note that 

/ G(I)(diP)U\\ 2 = EJ2\Xi\ 2 = d. (79) 
Jje l=1 



Thus, there is R > with 



/ G{I){dip) ll^f > d-e. (80) 

For any M' with dim ffl = d and p G ^(J^ 7 ), choose an eigenbasis of p to identify Jrf? 
with C d , so p = diag(pi, . . . ,Pd). Set Zi = .JplX^ so Z = {Z\, . . . , Z d ) has distribution 
G(p). Then 



•%ejr : |MI>fl} x 



= E(l EPj | Xj | 2 > fl2 ^p 4 |X J | 2 ) (82) 

i 

<E(l E |x f |^El^! 2 ) ( 83 ) 

i 

G(/)(#)|M| 2 <e. (84) 

{^ejr:||^||>-R} 

□ 

As an abbreviation, we write M(J%x,J%2,pi,b,f,e) or shorter M(f,e) for the set 
considered in ( 125]) . and M(J^i, J^, p x , b, f,e) or shorter M(f,e) for the set considered 
in 
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Lemma 7. Fix < e < 1, J#[, M^, b, and P \. Let R = R(e/4,di) with R(-, ■) provided 
by Lemma\^ Then, for every bounded measurable function f : S(J4?i) — > K. ; 

M(f,e) D M((/ o P)1 n<r N 2 , JL) n m(i n<r N 2 , ^) , (85) 

where P denotes the projection to the unit sphere as defined in ( |22|) and N(ipi) = 
on M\. 

Note that while /oPis not defined at G Jfi, (/ o P)N 2 is— as 0. 

Proof of Lemma\2^ If we want to show of a particular ip G &{ P i) that ^ G M(/, e) then, 
by Lemma El we need to show that 

\P*A P f\f)-GAP{ Pl ){f)\ <e\\f\U. (86) 

We note that, for any probability measure P on M\ with jl(N 2 ) = 1 (in particular, by 
Lemma HI for fi = jif' b ), writing g = f o P, we obtain that 

\PJ&{f) - GAP( Pl )(f)\ = \ P (gN 2 ) - G( Pl )(gN 2 )\ < (87) 
< jl(gl N<R N 2 ) -G( Pl )(gl N<R N 2 )\ + (88) 

+ \jl(gl N > R N 2 )\+ (89) 

+ \G( Pl )(gl N > R N 2 )\. (90) 

Now suppose ip G M^glN <R N 2 , e/AR 2 ^ , which means that ^ G ^(pi) and 

/^r^WiV 2 ) -G( Pl )(^WiV 2 )| < ^ \\gl N <RN 2 \\ 00 . (91) 
Since gl N<R N 2 is a bounded function with 

llsWA^iL^imioc, (92) 

it follows from f l9~Tj) that for /} = /if' 6 the term (I8"51) is less than ||/||oo 
The term (189]) is less than or equal to 

»/i(l W ># 2 ) (93) 



, {jl(N 2 ) - M W^ 2 )) (94) 

,(l-/2(WiV 2 )) (95) 

< H/IU (l - G( Pl )( 1 N<R N 2 ) + \ P (1 N<R N 2 ) - G( Pl )( WiV 2 )|) , (96) 

where we have used Lemma HI By Lemma El 

G( Pl )(l N<R N 2 )>l-e/4, (97) 
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so the term (1591 is less than or equal to 

ll/IU (e/4 + |MWiV 2 ) -G( Pl )(WiV 2 )|) . (9* 

Now suppose that also ^ e Ml l^^N 2 , e/AR 2 ) , which means that ■?/> G &(pi) and 



| P f b (WiV 2 ) - G( Pl )(WiV 2 )| < ^ || W^IU = I • (99) 

It follows from Q2g) and ([99]) that for /2 = /if' 6 , the term ([89} is less than Il/H^ £ /2. 
The term (I9"U|) is less than or equal to 

G( Pl )(WiV 2 ) (100) 
(l-G( Pl )(WiV 2 )) (101) 



<I|/|U|, (102) 



where we have used G(pi)(N 2 ) = 1 and ([9] 

Putting together the above bounds for the terms (1581) . (1591 . and (19*01) . we obtain 
that their sum is bounded by £||/||oo(z + § + z) — ^ll/lloo, so (155]) is satisfied and 
^M(/,j). ' □ 

Proof of TheoremUi Suppose we are given 0<e<l,0<5<l, and di G N. Set 

D 2 = £> 2 (e, 5, d0 = D 2 -, di) (103) 

with D 2 as provided by Lemma [2] and i? as in Lemma [71 Now consider any d 2 6 N 
with <i 2 > D 2 , any j£[ and ^2 with dim = di/ 2 , any orthonormal basis b of J^2, 
any Pl G and any bounded measurable function / : S(J^) — >■ R. Then, by two 

applications of Lemma "21 

u Pi (m( 9 WJV 2 ,^)) > 1-5/2, (104) 

u Pi [m{i n<r N 2 ^)) > 1-5/2. (105) 

Therefore, u pi of the intersection of these sets is greater than or equal to 1 — 5. By 
Lemma [7J this intersection is contained in M(f,s). Thus, u pi (M(f,e)) > 1 — 8, which 
is what we wanted to show. □ 



3.3 Proof of Theorem H 

Proof. Note that for any unitary U on ^f 2 

(U- l b j \^) = {b j \I x ®Uil)). (106) 

From this fact and the fact that the Haar measure is invariant under U h-> U^ 1 it 
follows that the distribution of pf ,b , when ip G ^( P i) is -u Pl -distributed and b is fixed, 
is the same as when b is MoArs-distributed and ip G ^( P i) is fixed. Thus, Theorem ~2] is 
equivalent to Theorem [TJ (It also follows that the distribution of pf' b , when ip G <5?( P i) 
is u pi -distributed and b is fixed, does not depend on b.) □ 
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3.4 Continuity of GAP 

For the proofs of Theorems [3] and HI we will exploit canonical typicality, i.e., the fact that 
for most ip G §(J%r), the reduced density matrix pf is close to tr 2 p#. Theorems [3] and 0] 
then follow from Theorem [2] via suitable continuity of the mapping p i— > GAP(p). The 
following two lemmas provide somewhat different statements about continuity: Recall 
that ^> 7 (J^) is the set of density matrices with all eigenvalues greater than or equal to 
7. Lemma asserts that GAP(p)(f) depends in a uniformly continuous way on both p 
and / when we restrict p to S> 1 {,^f) for arbitrarily small 7 > 0; continuity is not uniform 
without this restriction. However, Lemma [8] asserts that for any fixed and continuous 
test function /, continuity is uniform in p without restrictions. 

Lemma 8. For every < e < 1, every deN, every Hilbert space M 3 with dimJtf? = d, 
and every continuous function f : S(Jtf) —> R there is r = r(e, d, f) > such that for 
all p,tte 9{3tf), 

if \\p - tt\\ tI < r then \GAP(p)(f) - GAP(tt)(f)\ <e. (107) 

While all norms on ^(J^) are equivalent for dim Jft? < 00, we use the trace norm 
|| • || tr here because in this norm the continuity extends to dim = 00 and because it 
is used in Lemma [TJ 

To formulate the other continuity statement, let ■Us(jr) denote the normalized uniform 
measure on the unit sphere in Jtf. For any density matrix p G @(J(?) of which zero is 
not an eigenvalue, GAP(p) possesses a density relative to u§tjg>) [S]. 

Lemma 9. For every < e < 1, every deN, every Hilbert space 34? with dimJt? = d, 
and every < 7 < 1/d, there is r = r(e, d, 7) > such that for all p, ft G ^> 7 (^) ; 



if \\p — Q\\tr < t then 



dGAP(p) dGAP(tt) 



<e. (108) 



du§(j?) dus(je) 
As a consequence, for such p and Q, 

\GAP(p)(f) - GAP(n)(f)\ < s H/Ui (109) 

for every f G L 1 (S(Jf), us(jr)) ■ 

It follows in particular that for any fixed density matrix Q of which zero is not 
an eigenvalue and any sequence (p n ) of density matrices with p n — >■ Q, the density of 
GAP(p n ) converges to that of GAP(Q) in the || • ||oo norm: Take 7 > to be less than 
the smallest eigenvalue of Q and note that only finitely many p n can lie outside £^> 7 (J£°). 

To see that in Lemma|9]^> 1 (Jf) cannot be replaced by &{Jf) (i.e., that continuity 
is not uniform without restrictions), note that, when is an eigenvalue of Q, GAP(Q) 
does not have a density with respect to Ws(jT)> so that at such an O, p i-> GAP(p) is 
certainly not continuous in L°° (S(Jff), m§(,x j )) or in the variation distance ()48l) . 

To see that in Lemma M one cannot drop the assumption that / is continuous, 
consider an Q that has zero as an eigenvalue and a p that does not. Then GAP(Q) is 
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concentrated on a subspace of dimension less than d while GAP(p) has a density on 
the sphere and lies near (rather than in) that subspace. Thus, for a test function / 
that is bounded measurable but not continuous, GAP(p)(f) does not have to be close 
to GAP(n)(f). 

As part of the proof of Lemma [BJ we will need the continuity property of Gaussian 
measures expressed in the next lemma. When p n , p are measures on a topological space 
X, we write p n =>- p to denote that the sequence of measures p n converges weakly to p. 
This means that p n (f) —> p(f) for every bounded continuous function / : X — > R and 
implies that the same thing is true for every bounded measurable function / : X — > R 
such that p(D(f)) = 0, where D(f) is the set of discontinuities of /. 

Lemma 10. The mapping p i— > G(p) is continuous in the weak topology on measures: 
If p n G £$(C d ) for every n G N and p n — >■ p then G(p n ) =>■ G(p). 

Proof. We use characteristic functions; as usual, the characteristic function p : R 2d — > C 
of a probability measure p on R 2d is defined by 




(110) 



or, in our notation on Jif = C d , 

A(0) = J p(d*P) exp(zRe(</#>) , (111) 

where Re denotes the real part. We write p n = G(p n ) and p = G(p); their characteristic 
functions are: 

An(V0 = exp(-(^|p n |</>» , = exp(-<V|p|V)) • (H2) 

If p n — > p then (ip\p n \ip) — > (ip\p\ip) for every ip and thus /t n — >■ p pointwise. Since 
(e.g., [1]) pointwise convergence of the characteristic functions is equivalent (in finite 
dimension) to weak convergence of the associated measures, it follows that G(p n ) =>• 
G(p), which is what we wanted to show. □ 

Proof of Lemma\^ Since &{J^) is compact, uniform continuity follows from continuity. 
That is, it suffices to show that, assuming p n G @{M > ) for every n G N, 

if p n -> p then GAP(p n ) =4> GAP(p) . (113) 

This follows from Lemma [101 the continuity of the adjustment mapping A defined in 
( |2~TT) in Section [L4l and the continuity of the projection P : Jtff\{0} — >■ S(J^). Our first 
step is to establish the continuity of A on the set of probability measures p on Jif such 
that f p(dip) \\4>\\ 2 = L If, for every n 6 N, /i„ is a probability measure on the Borel 
a-algebra of .ffl such that f p n (dip) HV'II 2 = 1, then 

if p n =^ p and / p(dijj) \\i/j\\ 2 = 1 then Ap n =^ Ap . (114) 
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Fix e > and an arbitrary non-zero, bounded, continuous function / : Jff — > R. As 
before, we use the notation N(ip) = Since, by hypothesis, p(N 2 ) = 1, there exists 
R > so large that 



M#) \\W > 1 - ^rjrr- ■ (H5) 



{ipeJt:\\ip\\<R} 6 

Let the "cut-off function" xo '■ [0, oo) — > [0, 1] be any continuous function such that 
Xo(%) = 1 for x < R and Xo( x ) = for x > 2R; set xijP) — Xo(HV'll)- Because 
xN 2 and fxN 2 are bounded continuous functions, and because p n =>- p, we have that 
UnixN 2 ) — > p{xN 2 ) and p n (fxN 2 ) — > p(fxN 2 ); that is, there is an n x G N such that, 
for all n > ni, 

\p n ( X N 2 ) - p( X N 2 )\ < — ^- (116) 



3 



and 



|/in(/xiV 2 )-M/xAT 2 )| <|. (117) 



Thus, for all ri > ri\, we have that 

\Afi n (f) - Ap(f)\ = \^ n (fN 2 )-p(fN 2 )\ (118) 

< \fi n (f X N 2 ) -p(f X N 2 )\ + \fi n {f(l- X )N 2 )\ + - X )iV 2 )| (119) 

< | + H/HooM»((l " X)^ 2 ) + ||/||oo^((l - X)N 2 ) (120) 

= | + ll/IU(i - UxN 2 )) + 11/1100(1 - Mxiv 2 )) (121) 

< I + 211/1100(1 - p( X N 2 )) + ll/HooMxiV 2 ) - /i( X iV 2 )| (122) 

<£ + £ + £=e. (123) 

- 3 3 3 v 7 

This proves fflTD Fl 

We are now ready to establish (11131) . Suppose p n — > p. We have that GAP(p„) = 

P*A(G(p n )) and that (AG(p)) (0) = 0. Since ip ^ Pip is continuous for ^ ^ 0, (fTTHj) 

follows from ( |114p and Lemma [TU1 This completes the proof of Lemma [HI □ 



Proof of Lemma^ We first note that, for any self-adjoint dxd matrix A and ip G §(C d ), 



\A\ 



< \\A\\ < 



tr 



(124) 



4 We remark that the hypothesis f n(dip)\\ ip\\ 2 = 1 cannot be dropped, that is, does not follow from 
f /x„(d^)||-0|| 2 = 1. An example is \i n = (1 — \ / ti)5q + {1 / n)8^ n , where <5^ means the Dirac delta measure 



at (f> and ip n is any vector with HVViH 2 
but fi n <5o, which has / <5o(cfo/')IIV'l| 2 = 



then 



is a probability measure with f fj, n (dtp)\\i()\\ 



1 



0. 
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For any density matrix p G 3>(J?l?) of which zero is not an eigenvalue, the density of 
GAP(p) relative to Ms(jn is given by jS] 



dGAP(p) 



Ti d det p 

o 

2vr d det p 



drr 2d 1 r 2 exp(—r 2 (i^\p 



IP 



-d-l 



(125) 



(126) 



Using the last expression, we will now show that fllOSp holds when p is sufficiently 
close to Q. This follows from the facts (i) that, on f^> 7 (^), the functions p t- >■ 1/ det p 
and p i — ^ p ^ are uniformly continuous, (ii) that 



IP 



-ii 



-ii 



< Hp 



-i 



-ii 



tr 



(127) 



for all ^> G §(J£ 9 ), (iii) that the function x \- > x^^ 1 is uniformly continuous on the 
interval [l,oo), and (iv) that (tp\p~ l \tp) > 1, (^|f2 -1 |?/>) > 1. This establishes the 
existence of r(e, d, 7) > as described in Lemma |9j 
Now (11091) follows from (I108j) according to 

|GAP(p)(/)-G?i4P(0)(/)| 



S(Jf) 



( dGAPjpl dGAPjtl) , , 
du so*) ( — (V) - — 7: W ) /W 



dGAP(p) 



dm 



(V>) 



du§(jf) 
dGAP(Q) 



duf. 



(V>) 



l/(VOI<* 



1 • 



'1281 



(129) 



□ 



3.5 Proof of Theorem |3] 

Proof of Theorem^ Suppose we are given < e < 1, < 5 < 1, d± G N, a Hilbert 
space ^ of dimension d\, and a continuous function / : S(J^) — >■ K. Set 



= D R (e,S, dt,f) 



r(e/2,d h fy 



maxfrf^lSvr 3 log(8/<J) 



(130) 



with r(e, cf, /) as provided by Lemma El Now consider any c£r, g?2 G N with d R > Dr 
and d 2 > Z?2(e/2||/||oo, S/2, dx), any ,Jif 2 and C M{ <8> ^2 with dim J^/r = 
Let M(f,s) be the set mentioned in (129|) . 



let 



M(/,e) = {(^,6) e S(J^) x ONB(S%) : |pf fc (/) - GAP(tr 2 p*)(/)| < ^} , (131) 
M'(/,e) = {(V,&) G S(^) x OiVE^) : |pf fe (/) - G<AP(pf )(/) | < e] (132) 
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and 

M"{e) = [i> G S(J^) : \\pf -tr 2 p R \\ tI < e}. (133) 

Then, by Lemma [HI 

M(/, £ ) D M'(/,|) n [M"^^,^,/)) x ONB(J^)] . (134) 
Theorem [2] yields, using our assumption d 2 > D 2 (e/2||/|| 00 , 5/2, di), that for every 

u ONB {b E ONB(J^ 2 ) : \4> b (f) - GAP(pf)(f)\ < £} > 1 - 5/2 . (135) 

Thus, averaging over ip G S(J4?r) according to Ur, 

u r x uonb (M'(/, e/2)) > 1 - <J/2 . (136) 

Lemma[T]with rj = r/2 for r = r(e/2, di, f) yields, using our assumption g?r > 4d^/r 2 , 
which implies that di/yd~R < r/2, that 

n fl (Af"(r))>l-4exp(-^). (137) 

Using our assumption > 187r 3 41og(8/5)/r 2 , the right hand side is greater than or 
equal to 1 — 5/2, and thus 



ur x uqnb 



M"(r) x OiV£(^) > 1-5/2. (138) 



From flT36]) . f lT38]) . and ffT34l together we have that 



ur x uqnb 



M(f,e) >l-5, (139) 



which is what we wanted to show. □ 
3.6 Proof of Theorem [4] 

Proof of Theorem^ Suppose we are given < e < 1, < 5 < 1, di G N, < 7 < l/d\, 
and a Hilbert space J^f of dimension di . Set 

= ^(5,5,^,7) = ^max(d?,187r 3 log(8/5)) , (140) 

r' = r\e,d l , 1 )= l -r{e/2,d l , 1 ), (141) 

with r(e,d, 7) as provided by Lemma IHJ Now consider any dR,d 2 G N with c(r > .D^. 
and d 2 > D 2 (e/2,5/2,d x ), any ft G ^> 7 (^i), any Jf 2 and Jf K C ® Jf 2 with 
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dimJ4? 2 /R = d 2 /R, and any bounded measurable function / : §(J&i) — > R. Let M (f,e) 
be the set mentioned in 



(142) 



M (f,e) = |(V,6) G S(J&) x OiVi?(^) : (/) - < £ 

let, as in the proof of Theorem [3], 

M\f,e) = {(^,b) eS(J^ R ) x ONB(J^ 2 ) : \fif\f) - GAP(pf)(f)\ < E } , (143) 

let 

<(e) = G S(J^) : - fi|| tr < e}, (144) 
and let, as in the proof of Theorem [31 

Af"(e) = G S(J^) : ||pf -tr 2P/? || tr < e}. (145) 

Now assume 1 1 tr 2 pj? — f2 1 1 < r' . Then 

M£(2r') 2 M"(r') (146) 
and, by Lemma[9]and ||/||i < ||/||oo, 

M (/, e) D M' (7, ^k^) n [K(2r') x ONB(Jf 2 )] . (147) 

As in the proof of Theorem El Theorem [2] yields (11361) with e replaced by e||/||oo 
using our assumption d 2 > D 2 (e/2,5/2,di), and Lemma [H yields (1138H with r replaced 
by r', using our assumption dn > D' R . From (I136p . fll38j) . (11461) . and (11471) together we 
have that 



u R x U NB 



M (f,e) 



>1-S, (1481 



which is what we wanted to show. □ 
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